Dataset statistics
| Number of variables | 56 |
|---|---|
| Number of observations | 319952 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 136.7 MiB |
| Average record size in memory | 448.0 B |
Variable types
| Numeric | 8 |
|---|---|
| Categorical | 41 |
| Text | 7 |
NPCEP8A is highly imbalanced (61.7%) | Imbalance |
NPCEP9A is highly imbalanced (59.9%) | Imbalance |
NPCEP10 is highly imbalanced (58.1%) | Imbalance |
NPCEP11AA is highly imbalanced (60.4%) | Imbalance |
NPCEP13 is highly imbalanced (50.3%) | Imbalance |
NPCEP13A is highly imbalanced (87.9%) | Imbalance |
NPCEP15 is highly imbalanced (83.8%) | Imbalance |
NPCEP16A is highly imbalanced (83.0%) | Imbalance |
NPCEP16B is highly imbalanced (79.5%) | Imbalance |
NPCEP16C is highly imbalanced (95.6%) | Imbalance |
NPCEP16D is highly imbalanced (93.0%) | Imbalance |
NPCEP16E is highly imbalanced (99.1%) | Imbalance |
NPCEP16F is highly imbalanced (99.3%) | Imbalance |
NPCEP16G is highly imbalanced (99.8%) | Imbalance |
NPCEP16H is highly imbalanced (99.7%) | Imbalance |
NPCEP16I is highly imbalanced (99.9%) | Imbalance |
NPCEP16J is highly imbalanced (99.5%) | Imbalance |
NPCEP16K is highly imbalanced (90.4%) | Imbalance |
NPCEP16AA is highly imbalanced (80.4%) | Imbalance |
NPCEP16AB is highly imbalanced (92.6%) | Imbalance |
NPCEP16B1 is highly imbalanced (83.1%) | Imbalance |
NPCEP19 is highly imbalanced (97.8%) | Imbalance |
NPCEP21A is highly imbalanced (72.8%) | Imbalance |
NPCEP22A is highly imbalanced (70.8%) | Imbalance |
NPCEP24A is highly imbalanced (60.2%) | Imbalance |
NPCEP25A is highly imbalanced (71.1%) | Imbalance |
NPCEP27 is highly imbalanced (58.0%) | Imbalance |
DIRECTORIO_PER has unique values | Unique |
Reproduction
| Analysis started | 2024-05-07 04:51:03.781078 |
|---|---|
| Analysis finished | 2024-05-07 04:51:37.849049 |
| Duration | 34.07 seconds |
| Software version | ydata-profiling vv4.6.5 |
| Download configuration | config.json |
DIRECTORIO_PER
Real number (ℝ)
UNIQUE 
| Distinct | 319952 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 19682295 |
| Minimum | 10100011 |
|---|---|
| Maximum | 3.1754311 × 108 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.4 MiB |
Quantile statistics
| Minimum | 10100011 |
|---|---|
| 5-th percentile | 10913314 |
| Q1 | 14570212 |
| median | 18186114 |
| Q3 | 24850312 |
| 95-th percentile | 29416912 |
| Maximum | 3.1754311 × 108 |
| Range | 3.074431 × 108 |
| Interquartile range (IQR) | 10280100 |
Descriptive statistics
| Standard deviation | 7563170.8 |
|---|---|
| Coefficient of variation (CV) | 0.38426264 |
| Kurtosis | 290.38191 |
| Mean | 19682295 |
| Median Absolute Deviation (MAD) | 4765449.5 |
| Skewness | 10.43723 |
| Sum | 6.2973897 × 1012 |
| Variance | 5.7201552 × 1013 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10100011 | 1 | < 0.1% |
| 23193513 | 1 | < 0.1% |
| 23194413 | 1 | < 0.1% |
| 23194412 | 1 | < 0.1% |
| 23194411 | 1 | < 0.1% |
| 23193714 | 1 | < 0.1% |
| 23193713 | 1 | < 0.1% |
| 23193712 | 1 | < 0.1% |
| 23193711 | 1 | < 0.1% |
| 23193512 | 1 | < 0.1% |
| Other values (319942) | 319942 |
| Value | Count | Frequency (%) |
| 10100011 | 1 | |
| 10100012 | 1 | |
| 10100013 | 1 | |
| 10100111 | 1 | |
| 10100112 | 1 | |
| 10100113 | 1 | |
| 10100114 | 1 | |
| 10100211 | 1 | |
| 10100212 | 1 | |
| 10100311 | 1 |
| Value | Count | Frequency (%) |
| 317543112 | 1 | |
| 317543111 | 1 | |
| 317543110 | 1 | |
| 317463110 | 1 | |
| 315231110 | 1 | |
| 315230110 | 1 | |
| 294700110 | 1 | |
| 291262110 | 1 | |
| 287404110 | 1 | |
| 281937110 | 1 |
DIRECTORIO_HOG
Real number (ℝ)
| Distinct | 109111 |
|---|---|
| Distinct (%) | 34.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1955277.1 |
| Minimum | 1010001 |
|---|---|
| Maximum | 3178851 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.4 MiB |
Quantile statistics
| Minimum | 1010001 |
|---|---|
| 5-th percentile | 1091276.5 |
| Q1 | 1456781 |
| median | 1818101 |
| Q3 | 2483603.5 |
| 95-th percentile | 2940691 |
| Maximum | 3178851 |
| Range | 2168850 |
| Interquartile range (IQR) | 1026822.5 |
Descriptive statistics
| Standard deviation | 586867.83 |
|---|---|
| Coefficient of variation (CV) | 0.30014561 |
| Kurtosis | -1.1519761 |
| Mean | 1955277.1 |
| Median Absolute Deviation (MAD) | 475920 |
| Skewness | 0.26116389 |
| Sum | 6.2559481 × 1011 |
| Variance | 3.4441385 × 1011 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 1409051 | 17 | < 0.1% |
| 1091071 | 14 | < 0.1% |
| 1857011 | 14 | < 0.1% |
| 1692321 | 14 | < 0.1% |
| 1590861 | 14 | < 0.1% |
| 1120141 | 13 | < 0.1% |
| 2803081 | 13 | < 0.1% |
| 1473411 | 13 | < 0.1% |
| 1294831 | 13 | < 0.1% |
| 1353501 | 13 | < 0.1% |
| Other values (109101) | 319814 |
| Value | Count | Frequency (%) |
| 1010001 | 3 | |
| 1010011 | 4 | |
| 1010021 | 2 | < 0.1% |
| 1010031 | 3 | |
| 1010041 | 1 | < 0.1% |
| 1010051 | 1 | < 0.1% |
| 1010061 | 1 | < 0.1% |
| 1010071 | 4 | |
| 1010081 | 5 | |
| 1010082 | 3 |
| Value | Count | Frequency (%) |
| 3178851 | 2 | < 0.1% |
| 3178811 | 2 | < 0.1% |
| 3178741 | 1 | < 0.1% |
| 3178591 | 1 | < 0.1% |
| 3178441 | 2 | < 0.1% |
| 3178351 | 4 | |
| 3178341 | 2 | < 0.1% |
| 3178321 | 2 | < 0.1% |
| 3178311 | 3 | |
| 3178251 | 5 |
DIRECTORIO
Real number (ℝ)
| Distinct | 107218 |
|---|---|
| Distinct (%) | 33.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 195527.6 |
| Minimum | 101000 |
|---|---|
| Maximum | 317885 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.4 MiB |
Quantile statistics
| Minimum | 101000 |
|---|---|
| 5-th percentile | 109127.55 |
| Q1 | 145678 |
| median | 181810 |
| Q3 | 248360.25 |
| 95-th percentile | 294069 |
| Maximum | 317885 |
| Range | 216885 |
| Interquartile range (IQR) | 102682.25 |
Descriptive statistics
| Standard deviation | 58686.783 |
|---|---|
| Coefficient of variation (CV) | 0.30014577 |
| Kurtosis | -1.1519761 |
| Mean | 195527.6 |
| Median Absolute Deviation (MAD) | 47592 |
| Skewness | 0.26116388 |
| Sum | 6.2559448 × 1010 |
| Variance | 3.4441385 × 109 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 184980 | 22 | < 0.1% |
| 145788 | 22 | < 0.1% |
| 112379 | 21 | < 0.1% |
| 172991 | 19 | < 0.1% |
| 135993 | 19 | < 0.1% |
| 145803 | 18 | < 0.1% |
| 140905 | 17 | < 0.1% |
| 111041 | 17 | < 0.1% |
| 104460 | 16 | < 0.1% |
| 112477 | 16 | < 0.1% |
| Other values (107208) | 319765 |
| Value | Count | Frequency (%) |
| 101000 | 3 | < 0.1% |
| 101001 | 4 | |
| 101002 | 2 | < 0.1% |
| 101003 | 3 | < 0.1% |
| 101004 | 1 | < 0.1% |
| 101005 | 1 | < 0.1% |
| 101006 | 1 | < 0.1% |
| 101007 | 4 | |
| 101008 | 8 | |
| 101009 | 5 |
| Value | Count | Frequency (%) |
| 317885 | 2 | < 0.1% |
| 317881 | 2 | < 0.1% |
| 317874 | 1 | < 0.1% |
| 317859 | 1 | < 0.1% |
| 317844 | 2 | < 0.1% |
| 317835 | 4 | |
| 317834 | 2 | < 0.1% |
| 317832 | 2 | < 0.1% |
| 317831 | 3 | |
| 317825 | 5 |
SECUENCIA_P
Real number (ℝ)
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.0194842 |
| Minimum | 1 |
|---|---|
| Maximum | 9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 1 |
| Maximum | 9 |
| Range | 8 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.17583773 |
|---|---|
| Coefficient of variation (CV) | 0.17247716 |
| Kurtosis | 199.18062 |
| Mean | 1.0194842 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 12.069991 |
| Sum | 326186 |
| Variance | 0.030918908 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 315132 | |
| 2 | 3754 | 1.2% |
| 3 | 805 | 0.3% |
| 4 | 199 | 0.1% |
| 5 | 48 | < 0.1% |
| 6 | 7 | < 0.1% |
| 7 | 4 | < 0.1% |
| 8 | 2 | < 0.1% |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 315132 | |
| 2 | 3754 | 1.2% |
| 3 | 805 | 0.3% |
| 4 | 199 | 0.1% |
| 5 | 48 | < 0.1% |
| 6 | 7 | < 0.1% |
| 7 | 4 | < 0.1% |
| 8 | 2 | < 0.1% |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 9 | 1 | < 0.1% |
| 8 | 2 | < 0.1% |
| 7 | 4 | < 0.1% |
| 6 | 7 | < 0.1% |
| 5 | 48 | < 0.1% |
| 4 | 199 | 0.1% |
| 3 | 805 | 0.3% |
| 2 | 3754 | 1.2% |
| 1 | 315132 |
ORDEN
Real number (ℝ)
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.333525 |
| Minimum | 1 |
|---|---|
| Maximum | 17 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 5 |
| Maximum | 17 |
| Range | 16 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.3719547 |
|---|---|
| Coefficient of variation (CV) | 0.58793229 |
| Kurtosis | 2.4552737 |
| Mean | 2.333525 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.2729511 |
| Sum | 746616 |
| Variance | 1.8822597 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 109111 | |
| 2 | 90132 | |
| 3 | 62482 | |
| 4 | 35646 | 11.1% |
| 5 | 14103 | 4.4% |
| 6 | 5101 | 1.6% |
| 7 | 1980 | 0.6% |
| 8 | 801 | 0.3% |
| 9 | 339 | 0.1% |
| 10 | 142 | < 0.1% |
| Other values (7) | 115 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 109111 | |
| 2 | 90132 | |
| 3 | 62482 | |
| 4 | 35646 | 11.1% |
| 5 | 14103 | 4.4% |
| 6 | 5101 | 1.6% |
| 7 | 1980 | 0.6% |
| 8 | 801 | 0.3% |
| 9 | 339 | 0.1% |
| 10 | 142 | < 0.1% |
| Value | Count | Frequency (%) |
| 17 | 1 | < 0.1% |
| 16 | 1 | < 0.1% |
| 15 | 1 | < 0.1% |
| 14 | 5 | < 0.1% |
| 13 | 13 | < 0.1% |
| 12 | 30 | < 0.1% |
| 11 | 64 | < 0.1% |
| 10 | 142 | < 0.1% |
| 9 | 339 | |
| 8 | 801 |
NPCEP4
Real number (ℝ)
| Distinct | 108 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 35.573417 |
| Minimum | 0 |
|---|---|
| Maximum | 107 |
| Zeros | 3101 |
| Zeros (%) | 1.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 18 |
| median | 34 |
| Q3 | 52 |
| 95-th percentile | 73 |
| Maximum | 107 |
| Range | 107 |
| Interquartile range (IQR) | 34 |
Descriptive statistics
| Standard deviation | 21.347928 |
|---|---|
| Coefficient of variation (CV) | 0.60010899 |
| Kurtosis | -0.72245541 |
| Mean | 35.573417 |
| Median Absolute Deviation (MAD) | 17 |
| Skewness | 0.32342443 |
| Sum | 11381786 |
| Variance | 455.73401 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 23 | 5733 | 1.8% |
| 22 | 5725 | 1.8% |
| 25 | 5601 | 1.8% |
| 21 | 5553 | 1.7% |
| 30 | 5534 | 1.7% |
| 20 | 5514 | 1.7% |
| 35 | 5397 | 1.7% |
| 24 | 5373 | 1.7% |
| 27 | 5313 | 1.7% |
| 40 | 5294 | 1.7% |
| Other values (98) | 264915 |
| Value | Count | Frequency (%) |
| 0 | 3101 | |
| 1 | 3612 | |
| 2 | 3719 | |
| 3 | 3742 | |
| 4 | 3955 | |
| 5 | 3923 | |
| 6 | 3924 | |
| 7 | 4149 | |
| 8 | 4275 | |
| 9 | 4370 |
| Value | Count | Frequency (%) |
| 107 | 1 | < 0.1% |
| 106 | 2 | < 0.1% |
| 105 | 1 | < 0.1% |
| 104 | 1 | < 0.1% |
| 103 | 1 | < 0.1% |
| 102 | 3 | < 0.1% |
| 101 | 9 | < 0.1% |
| 100 | 11 | |
| 99 | 20 | |
| 98 | 26 |
NPCEP5
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
| 2 | |
|---|---|
| 1 | |
| 3 | 19 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 319952 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 1 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 169263 | |
| 1 | 150670 | |
| 3 | 19 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 169263 | |
| 1 | 150670 | |
| 3 | 19 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 169263 | |
| 1 | 150670 | |
| 3 | 19 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 319952 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 169263 | |
| 1 | 150670 | |
| 3 | 19 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 319952 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 169263 | |
| 1 | 150670 | |
| 3 | 19 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 319952 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 169263 | |
| 1 | 150670 | |
| 3 | 19 | < 0.1% |
NPCEP6
Real number (ℝ)
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.5451661 |
| Minimum | 1 |
|---|---|
| Maximum | 14 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 7 |
| Maximum | 14 |
| Range | 13 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.9287395 |
|---|---|
| Coefficient of variation (CV) | 0.75780496 |
| Kurtosis | 11.180038 |
| Mean | 2.5451661 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 2.8157097 |
| Sum | 814331 |
| Variance | 3.7200362 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 112421 | |
| 1 | 109111 | |
| 2 | 59928 | |
| 4 | 14828 | 4.6% |
| 9 | 5753 | 1.8% |
| 7 | 5614 | 1.8% |
| 5 | 4966 | 1.6% |
| 8 | 2844 | 0.9% |
| 14 | 2495 | 0.8% |
| 6 | 1364 | 0.4% |
| Other values (4) | 628 | 0.2% |
| Value | Count | Frequency (%) |
| 1 | 109111 | |
| 2 | 59928 | |
| 3 | 112421 | |
| 4 | 14828 | 4.6% |
| 5 | 4966 | 1.6% |
| 6 | 1364 | 0.4% |
| 7 | 5614 | 1.8% |
| 8 | 2844 | 0.9% |
| 9 | 5753 | 1.8% |
| 10 | 341 | 0.1% |
| Value | Count | Frequency (%) |
| 14 | 2495 | |
| 13 | 204 | 0.1% |
| 12 | 43 | < 0.1% |
| 11 | 40 | < 0.1% |
| 10 | 341 | 0.1% |
| 9 | 5753 | |
| 8 | 2844 | |
| 7 | 5614 | |
| 6 | 1364 | 0.4% |
| 5 | 4966 |
NPCEP7
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
| 5 | |
|---|---|
| 6 | |
| 2 | |
| 4 | |
| Other values (2) |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 319952 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 6 |
|---|---|
| 2nd row | 6 |
| 3rd row | 5 |
| 4th row | 6 |
| 5th row | 6 |
Common Values
| Value | Count | Frequency (%) |
| 5 | 117639 | |
| 6 | 70622 | |
| 2 | 53629 | |
| 38770 | 12.1% | |
| 4 | 19256 | 6.0% |
| 3 | 13273 | 4.1% |
| 1 | 6763 | 2.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 5 | 117639 | |
| 6 | 70622 | |
| 2 | 53629 | |
| 4 | 19256 | 6.8% |
| 3 | 13273 | 4.7% |
| 1 | 6763 | 2.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 117639 | |
| 6 | 70622 | |
| 2 | 53629 | |
| 38770 | 12.1% | |
| 4 | 19256 | 6.0% |
| 3 | 13273 | 4.1% |
| 1 | 6763 | 2.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 281182 | |
| Space Separator | 38770 | 12.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 117639 | |
| 6 | 70622 | |
| 2 | 53629 | |
| 4 | 19256 | 6.8% |
| 3 | 13273 | 4.7% |
| 1 | 6763 | 2.4% |
Space Separator
| Value | Count | Frequency (%) |
| 38770 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 319952 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 5 | 117639 | |
| 6 | 70622 | |
| 2 | 53629 | |
| 38770 | 12.1% | |
| 4 | 19256 | 6.0% |
| 3 | 13273 | 4.1% |
| 1 | 6763 | 2.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 319952 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5 | 117639 | |
| 6 | 70622 | |
| 2 | 53629 | |
| 38770 | 12.1% | |
| 4 | 19256 | 6.0% |
| 3 | 13273 | 4.1% |
| 1 | 6763 | 2.1% |
NPCEP8
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
| 1 | |
|---|---|
| 2 | 3853 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 319952 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 188938 | ||
| 1 | 127161 | |
| 2 | 3853 | 1.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 127161 | |
| 2 | 3853 | 2.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 188938 | ||
| 1 | 127161 | |
| 2 | 3853 | 1.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 188938 | |
| Decimal Number | 131014 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 127161 | |
| 2 | 3853 | 2.9% |
Space Separator
| Value | Count | Frequency (%) |
| 188938 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 319952 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 188938 | ||
| 1 | 127161 | |
| 2 | 3853 | 1.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 319952 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 188938 | ||
| 1 | 127161 | |
| 2 | 3853 | 1.2% |
NPCEP8A
Categorical
IMBALANCE 
| Distinct | 15 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
| 2 | |
|---|---|
| 1 | |
| 3 | 2613 |
| 4 | 1752 |
| Other values (10) | 1654 |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.0000813 |
| Min length | 1 |
Characters and Unicode
| Total characters | 319978 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 1 |
| 3rd row | |
| 4th row | 2 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 192791 | ||
| 2 | 61224 | 19.1% |
| 1 | 59918 | 18.7% |
| 3 | 2613 | 0.8% |
| 4 | 1752 | 0.5% |
| 5 | 862 | 0.3% |
| 6 | 440 | 0.1% |
| 7 | 193 | 0.1% |
| 8 | 87 | < 0.1% |
| 9 | 46 | < 0.1% |
| Other values (5) | 26 | < 0.1% |
Length
| Value | Count | Frequency (%) |
| 2 | 61224 | |
| 1 | 59918 | |
| 3 | 2613 | 2.1% |
| 4 | 1752 | 1.4% |
| 5 | 862 | 0.7% |
| 6 | 440 | 0.3% |
| 7 | 193 | 0.2% |
| 8 | 87 | 0.1% |
| 9 | 46 | < 0.1% |
| 10 | 15 | < 0.1% |
| Other values (4) | 11 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 192791 | ||
| 2 | 61228 | 19.1% |
| 1 | 59949 | 18.7% |
| 3 | 2613 | 0.8% |
| 4 | 1752 | 0.5% |
| 5 | 863 | 0.3% |
| 6 | 441 | 0.1% |
| 7 | 193 | 0.1% |
| 8 | 87 | < 0.1% |
| 9 | 46 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 192791 | |
| Decimal Number | 127187 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 61228 | |
| 1 | 59949 | |
| 3 | 2613 | 2.1% |
| 4 | 1752 | 1.4% |
| 5 | 863 | 0.7% |
| 6 | 441 | 0.3% |
| 7 | 193 | 0.2% |
| 8 | 87 | 0.1% |
| 9 | 46 | < 0.1% |
| 0 | 15 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 192791 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 319978 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 192791 | ||
| 2 | 61228 | 19.1% |
| 1 | 59949 | 18.7% |
| 3 | 2613 | 0.8% |
| 4 | 1752 | 0.5% |
| 5 | 863 | 0.3% |
| 6 | 441 | 0.1% |
| 7 | 193 | 0.1% |
| 8 | 87 | < 0.1% |
| 9 | 46 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 319978 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 192791 | ||
| 2 | 61228 | 19.1% |
| 1 | 59949 | 18.7% |
| 3 | 2613 | 0.8% |
| 4 | 1752 | 0.5% |
| 5 | 863 | 0.3% |
| 6 | 441 | 0.1% |
| 7 | 193 | 0.1% |
| 8 | 87 | < 0.1% |
| 9 | 46 | < 0.1% |
NPCEP9
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
| 2 | |
|---|---|
| 3 | |
| 1 | 6929 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 319952 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 3 |
| 3rd row | 2 |
| 4th row | 2 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 213237 | |
| 3 | 99786 | |
| 1 | 6929 | 2.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 213237 | |
| 3 | 99786 | |
| 1 | 6929 | 2.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 213237 | |
| 3 | 99786 | |
| 1 | 6929 | 2.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 319952 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 213237 | |
| 3 | 99786 | |
| 1 | 6929 | 2.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 319952 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 213237 | |
| 3 | 99786 | |
| 1 | 6929 | 2.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 319952 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 213237 | |
| 3 | 99786 | |
| 1 | 6929 | 2.2% |
NPCEP9A
Categorical
IMBALANCE 
| Distinct | 34 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
| 25 | |
|---|---|
| 15 | 15577 |
| 11 | 10785 |
| 73 | 9775 |
| Other values (29) |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.311878 |
| Min length | 1 |
Characters and Unicode
| Total characters | 419738 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 15 |
|---|---|
| 2nd row | 15 |
| 3rd row | |
| 4th row | |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 220166 | ||
| 25 | 26980 | 8.4% |
| 15 | 15577 | 4.9% |
| 11 | 10785 | 3.4% |
| 73 | 9775 | 3.1% |
| 68 | 5819 | 1.8% |
| 17 | 3218 | 1.0% |
| 05 | 3154 | 1.0% |
| 41 | 3007 | 0.9% |
| 76 | 2886 | 0.9% |
| Other values (24) | 18585 | 5.8% |
Length
| Value | Count | Frequency (%) |
| 25 | 26980 | |
| 15 | 15577 | |
| 11 | 10785 | 10.8% |
| 73 | 9775 | 9.8% |
| 68 | 5819 | 5.8% |
| 17 | 3218 | 3.2% |
| 05 | 3154 | 3.2% |
| 41 | 3007 | 3.0% |
| 76 | 2886 | 2.9% |
| 50 | 2131 | 2.1% |
| Other values (23) | 16454 |
Most occurring characters
| Value | Count | Frequency (%) |
| 220166 | ||
| 5 | 51289 | 12.2% |
| 1 | 47183 | 11.2% |
| 2 | 31199 | 7.4% |
| 7 | 18425 | 4.4% |
| 3 | 13869 | 3.3% |
| 6 | 11548 | 2.8% |
| 8 | 9366 | 2.2% |
| 0 | 8918 | 2.1% |
| 4 | 6453 | 1.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 220166 | |
| Decimal Number | 199572 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 51289 | |
| 1 | 47183 | |
| 2 | 31199 | |
| 7 | 18425 | 9.2% |
| 3 | 13869 | 6.9% |
| 6 | 11548 | 5.8% |
| 8 | 9366 | 4.7% |
| 0 | 8918 | 4.5% |
| 4 | 6453 | 3.2% |
| 9 | 1322 | 0.7% |
Space Separator
| Value | Count | Frequency (%) |
| 220166 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 419738 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 220166 | ||
| 5 | 51289 | 12.2% |
| 1 | 47183 | 11.2% |
| 2 | 31199 | 7.4% |
| 7 | 18425 | 4.4% |
| 3 | 13869 | 3.3% |
| 6 | 11548 | 2.8% |
| 8 | 9366 | 2.2% |
| 0 | 8918 | 2.1% |
| 4 | 6453 | 1.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 419738 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 220166 | ||
| 5 | 51289 | 12.2% |
| 1 | 47183 | 11.2% |
| 2 | 31199 | 7.4% |
| 7 | 18425 | 4.4% |
| 3 | 13869 | 3.3% |
| 6 | 11548 | 2.8% |
| 8 | 9366 | 2.2% |
| 0 | 8918 | 2.1% |
| 4 | 6453 | 1.5% |
NPCEP9B
Text
| Distinct | 1100 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 1 |
| Mean length | 2.2475121 |
| Min length | 1 |
Characters and Unicode
| Total characters | 719096 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 36 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 15842 |
|---|---|
| 2nd row | 15842 |
| 3rd row | |
| 4th row | |
| 5th row |
| Value | Count | Frequency (%) |
| 11001 | 10784 | 10.8% |
| 73001 | 2029 | 2.0% |
| 15001 | 1525 | 1.5% |
| 68001 | 1426 | 1.4% |
| 05001 | 1411 | 1.4% |
| 08001 | 1367 | 1.4% |
| 76001 | 1340 | 1.3% |
| 25899 | 1066 | 1.1% |
| 50001 | 1059 | 1.1% |
| 41001 | 1045 | 1.0% |
| Other values (1089) | 76734 |
Most occurring characters
| Value | Count | Frequency (%) |
| 220166 | ||
| 1 | 98025 | |
| 0 | 87923 | 12.2% |
| 5 | 72453 | 10.1% |
| 2 | 53700 | 7.5% |
| 7 | 41514 | 5.8% |
| 3 | 37030 | 5.1% |
| 8 | 32816 | 4.6% |
| 6 | 31941 | 4.4% |
| 4 | 26088 | 3.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 498930 | |
| Space Separator | 220166 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 98025 | |
| 0 | 87923 | |
| 5 | 72453 | |
| 2 | 53700 | |
| 7 | 41514 | |
| 3 | 37030 | 7.4% |
| 8 | 32816 | 6.6% |
| 6 | 31941 | 6.4% |
| 4 | 26088 | 5.2% |
| 9 | 17440 | 3.5% |
Space Separator
| Value | Count | Frequency (%) |
| 220166 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 719096 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 220166 | ||
| 1 | 98025 | |
| 0 | 87923 | 12.2% |
| 5 | 72453 | 10.1% |
| 2 | 53700 | 7.5% |
| 7 | 41514 | 5.8% |
| 3 | 37030 | 5.1% |
| 8 | 32816 | 4.6% |
| 6 | 31941 | 4.4% |
| 4 | 26088 | 3.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 719096 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 220166 | ||
| 1 | 98025 | |
| 0 | 87923 | 12.2% |
| 5 | 72453 | 10.1% |
| 2 | 53700 | 7.5% |
| 7 | 41514 | 5.8% |
| 3 | 37030 | 5.1% |
| 8 | 32816 | 4.6% |
| 6 | 31941 | 4.4% |
| 4 | 26088 | 3.6% |
NPCEP10
Categorical
IMBALANCE 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
| 1 | |
|---|---|
| 2 | |
| 6929 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 319952 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 2 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 275897 | |
| 2 | 37126 | 11.6% |
| 6929 | 2.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 275897 | |
| 2 | 37126 | 11.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 275897 | |
| 2 | 37126 | 11.6% |
| 6929 | 2.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 313023 | |
| Space Separator | 6929 | 2.2% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 275897 | |
| 2 | 37126 | 11.9% |
Space Separator
| Value | Count | Frequency (%) |
| 6929 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 319952 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 275897 | |
| 2 | 37126 | 11.6% |
| 6929 | 2.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 319952 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 275897 | |
| 2 | 37126 | 11.6% |
| 6929 | 2.2% |
NPCEP11A
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
| 1 | |
|---|---|
| 2 | |
| 3 | 3549 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 319952 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 215200 | |
| 2 | 101203 | |
| 3 | 3549 | 1.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 215200 | |
| 2 | 101203 | |
| 3 | 3549 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 215200 | |
| 2 | 101203 | |
| 3 | 3549 | 1.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 319952 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 215200 | |
| 2 | 101203 | |
| 3 | 3549 | 1.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 319952 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 215200 | |
| 2 | 101203 | |
| 3 | 3549 | 1.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 319952 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 215200 | |
| 2 | 101203 | |
| 3 | 3549 | 1.1% |
NPCEP11AA
Categorical
IMBALANCE 
| Distinct | 34 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
| 25 | |
|---|---|
| 11 | 16579 |
| 15 | 13663 |
| 73 | 8558 |
| Other values (29) |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.3024235 |
| Min length | 1 |
Characters and Unicode
| Total characters | 416713 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 15 |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 218749 | ||
| 25 | 29657 | 9.3% |
| 11 | 16579 | 5.2% |
| 15 | 13663 | 4.3% |
| 73 | 8558 | 2.7% |
| 68 | 5066 | 1.6% |
| 17 | 2894 | 0.9% |
| 5 | 2844 | 0.9% |
| 76 | 2623 | 0.8% |
| 41 | 2564 | 0.8% |
| Other values (24) | 16755 | 5.2% |
Length
| Value | Count | Frequency (%) |
| 25 | 29657 | |
| 11 | 16579 | |
| 15 | 13663 | |
| 73 | 8558 | 8.5% |
| 68 | 5066 | 5.0% |
| 17 | 2894 | 2.9% |
| 5 | 2844 | 2.8% |
| 76 | 2623 | 2.6% |
| 41 | 2564 | 2.5% |
| 50 | 2190 | 2.2% |
| Other values (23) | 14565 |
Most occurring characters
| Value | Count | Frequency (%) |
| 218749 | ||
| 1 | 55634 | 13.4% |
| 5 | 51389 | 12.3% |
| 2 | 33322 | 8.0% |
| 7 | 16347 | 3.9% |
| 3 | 12120 | 2.9% |
| 6 | 10216 | 2.5% |
| 8 | 8266 | 2.0% |
| 4 | 5627 | 1.4% |
| 0 | 3885 | 0.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 218749 | |
| Decimal Number | 197964 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 55634 | |
| 5 | 51389 | |
| 2 | 33322 | |
| 7 | 16347 | 8.3% |
| 3 | 12120 | 6.1% |
| 6 | 10216 | 5.2% |
| 8 | 8266 | 4.2% |
| 4 | 5627 | 2.8% |
| 0 | 3885 | 2.0% |
| 9 | 1158 | 0.6% |
Space Separator
| Value | Count | Frequency (%) |
| 218749 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 416713 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 218749 | ||
| 1 | 55634 | 13.4% |
| 5 | 51389 | 12.3% |
| 2 | 33322 | 8.0% |
| 7 | 16347 | 3.9% |
| 3 | 12120 | 2.9% |
| 6 | 10216 | 2.5% |
| 8 | 8266 | 2.0% |
| 4 | 5627 | 1.4% |
| 0 | 3885 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 416713 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 218749 | ||
| 1 | 55634 | 13.4% |
| 5 | 51389 | 12.3% |
| 2 | 33322 | 8.0% |
| 7 | 16347 | 3.9% |
| 3 | 12120 | 2.9% |
| 6 | 10216 | 2.5% |
| 8 | 8266 | 2.0% |
| 4 | 5627 | 1.4% |
| 0 | 3885 | 0.9% |
NPCEP11AB
Text
| Distinct | 1092 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 1 |
| Mean length | 2.251344 |
| Min length | 1 |
Characters and Unicode
| Total characters | 720322 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 36 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 15842 |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | |
| 5th row |
| Value | Count | Frequency (%) |
| 11001 | 16579 | 16.4% |
| 25899 | 3098 | 3.1% |
| 25269 | 2113 | 2.1% |
| 73001 | 1810 | 1.8% |
| 25175 | 1455 | 1.4% |
| 15001 | 1394 | 1.4% |
| 8001 | 1269 | 1.3% |
| 5001 | 1266 | 1.3% |
| 25843 | 1239 | 1.2% |
| 68001 | 1231 | 1.2% |
| Other values (1081) | 69749 |
Most occurring characters
| Value | Count | Frequency (%) |
| 218749 | ||
| 1 | 109208 | |
| 0 | 88811 | |
| 5 | 71306 | 9.9% |
| 2 | 54384 | 7.5% |
| 7 | 37917 | 5.3% |
| 3 | 33546 | 4.7% |
| 8 | 31968 | 4.4% |
| 6 | 29599 | 4.1% |
| 4 | 23744 | 3.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 501573 | |
| Space Separator | 218749 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 109208 | |
| 0 | 88811 | |
| 5 | 71306 | |
| 2 | 54384 | |
| 7 | 37917 | 7.6% |
| 3 | 33546 | 6.7% |
| 8 | 31968 | 6.4% |
| 6 | 29599 | 5.9% |
| 4 | 23744 | 4.7% |
| 9 | 21090 | 4.2% |
Space Separator
| Value | Count | Frequency (%) |
| 218749 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 720322 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 218749 | ||
| 1 | 109208 | |
| 0 | 88811 | |
| 5 | 71306 | 9.9% |
| 2 | 54384 | 7.5% |
| 7 | 37917 | 5.3% |
| 3 | 33546 | 4.7% |
| 8 | 31968 | 4.4% |
| 6 | 29599 | 4.1% |
| 4 | 23744 | 3.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 720322 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 218749 | ||
| 1 | 109208 | |
| 0 | 88811 | |
| 5 | 71306 | 9.9% |
| 2 | 54384 | 7.5% |
| 7 | 37917 | 5.3% |
| 3 | 33546 | 4.7% |
| 8 | 31968 | 4.4% |
| 6 | 29599 | 4.1% |
| 4 | 23744 | 3.3% |
NPCEP11AC
Text
| Distinct | 198 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
Length
| Max length | 28 |
|---|---|
| Median length | 1 |
| Mean length | 1.0850159 |
| Min length | 1 |
Characters and Unicode
| Total characters | 347153 |
|---|---|
| Distinct characters | 48 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 113 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | |
| 5th row |
| Value | Count | Frequency (%) |
| venezuela | 2320 | |
| estados | 135 | 3.5% |
| unidos | 134 | 3.5% |
| ecuador | 130 | 3.4% |
| espana | 94 | 2.5% |
| francia | 63 | 1.6% |
| argentina | 55 | 1.4% |
| mexico | 55 | 1.4% |
| alemania | 54 | 1.4% |
| brasil | 50 | 1.3% |
| Other values (183) | 736 | 19.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 316682 | ||
| E | 7823 | 2.3% |
| A | 4075 | 1.2% |
| N | 3028 | 0.9% |
| U | 2898 | 0.8% |
| L | 2676 | 0.8% |
| V | 2393 | 0.7% |
| Z | 2357 | 0.7% |
| I | 809 | 0.2% |
| S | 742 | 0.2% |
| Other values (38) | 3670 | 1.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 316682 | |
| Uppercase Letter | 30362 | 8.7% |
| Lowercase Letter | 82 | < 0.1% |
| Control | 10 | < 0.1% |
| Other Punctuation | 8 | < 0.1% |
| Decimal Number | 6 | < 0.1% |
| Dash Punctuation | 3 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 7823 | |
| A | 4075 | |
| N | 3028 | 10.0% |
| U | 2898 | 9.5% |
| L | 2676 | 8.8% |
| V | 2393 | 7.9% |
| Z | 2357 | 7.8% |
| I | 809 | 2.7% |
| S | 742 | 2.4% |
| O | 657 | 2.2% |
| Other values (17) | 2904 | 9.6% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 16 | |
| e | 13 | |
| u | 8 | |
| l | 7 | |
| i | 7 | |
| n | 7 | |
| s | 6 | 7.3% |
| z | 4 | 4.9% |
| r | 3 | 3.7% |
| o | 3 | 3.7% |
| Other values (4) | 8 |
Control
| Value | Count | Frequency (%) |
| | 9 | |
| | 1 | 10.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 5 | |
| , | 3 |
Space Separator
| Value | Count | Frequency (%) |
| 316682 |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 6 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 316709 | |
| Latin | 30444 | 8.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 7823 | |
| A | 4075 | |
| N | 3028 | 9.9% |
| U | 2898 | 9.5% |
| L | 2676 | 8.8% |
| V | 2393 | 7.9% |
| Z | 2357 | 7.7% |
| I | 809 | 2.7% |
| S | 742 | 2.4% |
| O | 657 | 2.2% |
| Other values (31) | 2986 | 9.8% |
Common
| Value | Count | Frequency (%) |
| 316682 | ||
| | 9 | < 0.1% |
| 9 | 6 | < 0.1% |
| . | 5 | < 0.1% |
| - | 3 | < 0.1% |
| , | 3 | < 0.1% |
| | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 347133 | |
| None | 20 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 316682 | ||
| E | 7823 | 2.3% |
| A | 4075 | 1.2% |
| N | 3028 | 0.9% |
| U | 2898 | 0.8% |
| L | 2676 | 0.8% |
| V | 2393 | 0.7% |
| Z | 2357 | 0.7% |
| I | 809 | 0.2% |
| S | 742 | 0.2% |
| Other values (35) | 3650 | 1.1% |
None
| Value | Count | Frequency (%) |
| Ã | 10 | |
| | 9 | |
| | 1 | 5.0% |
NPCEP11
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
| 1 | |
|---|---|
| 2 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 319952 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 2 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 234065 | |
| 2 | 85887 | 26.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 234065 | |
| 2 | 85887 | 26.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 234065 | |
| 2 | 85887 | 26.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 319952 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 234065 | |
| 2 | 85887 | 26.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 319952 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 234065 | |
| 2 | 85887 | 26.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 319952 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 234065 | |
| 2 | 85887 | 26.8% |
NPCEP13
Categorical
IMBALANCE 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
| 2 | |
|---|---|
| 3 | |
| 4 | 2849 |
| 1 | 1422 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 319952 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 2 |
| 3rd row | |
| 4th row | |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 234065 | ||
| 2 | 56713 | 17.7% |
| 3 | 24903 | 7.8% |
| 4 | 2849 | 0.9% |
| 1 | 1422 | 0.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 56713 | |
| 3 | 24903 | |
| 4 | 2849 | 3.3% |
| 1 | 1422 | 1.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 234065 | ||
| 2 | 56713 | 17.7% |
| 3 | 24903 | 7.8% |
| 4 | 2849 | 0.9% |
| 1 | 1422 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 234065 | |
| Decimal Number | 85887 | 26.8% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 56713 | |
| 3 | 24903 | |
| 4 | 2849 | 3.3% |
| 1 | 1422 | 1.7% |
Space Separator
| Value | Count | Frequency (%) |
| 234065 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 319952 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 234065 | ||
| 2 | 56713 | 17.7% |
| 3 | 24903 | 7.8% |
| 4 | 2849 | 0.9% |
| 1 | 1422 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 319952 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 234065 | ||
| 2 | 56713 | 17.7% |
| 3 | 24903 | 7.8% |
| 4 | 2849 | 0.9% |
| 1 | 1422 | 0.4% |
NPCEP13A
Categorical
IMBALANCE 
| Distinct | 34 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
| 11 | 12203 |
|---|---|
| 25 | 4743 |
| 15 | 1053 |
| 73 | 913 |
| Other values (29) | 5991 |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.0778336 |
| Min length | 1 |
Characters and Unicode
| Total characters | 344855 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 295049 | ||
| 11 | 12203 | 3.8% |
| 25 | 4743 | 1.5% |
| 15 | 1053 | 0.3% |
| 73 | 913 | 0.3% |
| 05 | 602 | 0.2% |
| 50 | 553 | 0.2% |
| 68 | 528 | 0.2% |
| 76 | 440 | 0.1% |
| 08 | 406 | 0.1% |
| Other values (24) | 3462 | 1.1% |
Length
| Value | Count | Frequency (%) |
| 11 | 12203 | |
| 25 | 4743 | 19.0% |
| 15 | 1053 | 4.2% |
| 73 | 913 | 3.7% |
| 05 | 602 | 2.4% |
| 50 | 553 | 2.2% |
| 68 | 528 | 2.1% |
| 76 | 440 | 1.8% |
| 08 | 406 | 1.6% |
| 41 | 381 | 1.5% |
| Other values (23) | 3081 | 12.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 295049 | ||
| 1 | 26835 | 7.8% |
| 5 | 7636 | 2.2% |
| 2 | 5469 | 1.6% |
| 7 | 2113 | 0.6% |
| 0 | 1933 | 0.6% |
| 3 | 1643 | 0.5% |
| 6 | 1411 | 0.4% |
| 8 | 1381 | 0.4% |
| 4 | 1086 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 295049 | |
| Decimal Number | 49806 | 14.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 26835 | |
| 5 | 7636 | 15.3% |
| 2 | 5469 | 11.0% |
| 7 | 2113 | 4.2% |
| 0 | 1933 | 3.9% |
| 3 | 1643 | 3.3% |
| 6 | 1411 | 2.8% |
| 8 | 1381 | 2.8% |
| 4 | 1086 | 2.2% |
| 9 | 299 | 0.6% |
Space Separator
| Value | Count | Frequency (%) |
| 295049 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 344855 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 295049 | ||
| 1 | 26835 | 7.8% |
| 5 | 7636 | 2.2% |
| 2 | 5469 | 1.6% |
| 7 | 2113 | 0.6% |
| 0 | 1933 | 0.6% |
| 3 | 1643 | 0.5% |
| 6 | 1411 | 0.4% |
| 8 | 1381 | 0.4% |
| 4 | 1086 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 344855 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 295049 | ||
| 1 | 26835 | 7.8% |
| 5 | 7636 | 2.2% |
| 2 | 5469 | 1.6% |
| 7 | 2113 | 0.6% |
| 0 | 1933 | 0.6% |
| 3 | 1643 | 0.5% |
| 6 | 1411 | 0.4% |
| 8 | 1381 | 0.4% |
| 4 | 1086 | 0.3% |
NPCEP13B
Text
| Distinct | 794 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 1 |
| Mean length | 1.3113342 |
| Min length | 1 |
Characters and Unicode
| Total characters | 419564 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 159 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | |
| 5th row |
| Value | Count | Frequency (%) |
| 11001 | 12204 | |
| 05001 | 373 | 1.5% |
| 08001 | 330 | 1.3% |
| 50001 | 320 | 1.3% |
| 73001 | 302 | 1.2% |
| 25126 | 291 | 1.2% |
| 25899 | 267 | 1.1% |
| 76001 | 256 | 1.0% |
| 68001 | 252 | 1.0% |
| 54001 | 246 | 1.0% |
| Other values (783) | 10062 |
Most occurring characters
| Value | Count | Frequency (%) |
| 295049 | ||
| 1 | 45398 | 10.8% |
| 0 | 36333 | 8.7% |
| 5 | 10232 | 2.4% |
| 2 | 8304 | 2.0% |
| 7 | 5440 | 1.3% |
| 3 | 4346 | 1.0% |
| 8 | 4217 | 1.0% |
| 6 | 4128 | 1.0% |
| 4 | 3312 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 295049 | |
| Decimal Number | 124515 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 45398 | |
| 0 | 36333 | |
| 5 | 10232 | 8.2% |
| 2 | 8304 | 6.7% |
| 7 | 5440 | 4.4% |
| 3 | 4346 | 3.5% |
| 8 | 4217 | 3.4% |
| 6 | 4128 | 3.3% |
| 4 | 3312 | 2.7% |
| 9 | 2805 | 2.3% |
Space Separator
| Value | Count | Frequency (%) |
| 295049 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 419564 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 295049 | ||
| 1 | 45398 | 10.8% |
| 0 | 36333 | 8.7% |
| 5 | 10232 | 2.4% |
| 2 | 8304 | 2.0% |
| 7 | 5440 | 1.3% |
| 3 | 4346 | 1.0% |
| 8 | 4217 | 1.0% |
| 6 | 4128 | 1.0% |
| 4 | 3312 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 419564 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 295049 | ||
| 1 | 45398 | 10.8% |
| 0 | 36333 | 8.7% |
| 5 | 10232 | 2.4% |
| 2 | 8304 | 2.0% |
| 7 | 5440 | 1.3% |
| 3 | 4346 | 1.0% |
| 8 | 4217 | 1.0% |
| 6 | 4128 | 1.0% |
| 4 | 3312 | 0.8% |
NPCEP13C
Text
| Distinct | 137 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
Length
| Max length | 35 |
|---|---|
| Median length | 1 |
| Mean length | 1.0686259 |
| Min length | 1 |
Characters and Unicode
| Total characters | 341909 |
|---|---|
| Distinct characters | 46 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 77 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | |
| 5th row |
| Value | Count | Frequency (%) |
| venezuela | 1957 | |
| estados | 114 | 3.8% |
| unidos | 112 | 3.7% |
| espana | 104 | 3.4% |
| argentina | 55 | 1.8% |
| ecuador | 50 | 1.7% |
| mexico | 41 | 1.4% |
| francia | 38 | 1.3% |
| brasil | 37 | 1.2% |
| colombia | 36 | 1.2% |
| Other values (133) | 476 | 15.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 317274 | ||
| E | 6495 | 1.9% |
| A | 3282 | 1.0% |
| N | 2525 | 0.7% |
| U | 2333 | 0.7% |
| L | 2237 | 0.7% |
| V | 1994 | 0.6% |
| Z | 1992 | 0.6% |
| S | 610 | 0.2% |
| I | 608 | 0.2% |
| Other values (36) | 2559 | 0.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 317274 | |
| Uppercase Letter | 24573 | 7.2% |
| Lowercase Letter | 44 | < 0.1% |
| Decimal Number | 6 | < 0.1% |
| Control | 5 | < 0.1% |
| Other Punctuation | 4 | < 0.1% |
| Dash Punctuation | 3 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 6495 | |
| A | 3282 | |
| N | 2525 | 10.3% |
| U | 2333 | 9.5% |
| L | 2237 | 9.1% |
| V | 1994 | 8.1% |
| Z | 1992 | 8.1% |
| S | 610 | 2.5% |
| I | 608 | 2.5% |
| O | 488 | 2.0% |
| Other values (17) | 2009 | 8.2% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 9 | |
| t | 4 | |
| d | 4 | |
| e | 4 | |
| o | 4 | |
| s | 4 | |
| i | 4 | |
| n | 3 | 6.8% |
| l | 3 | 6.8% |
| u | 3 | 6.8% |
| Other values (2) | 2 | 4.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 2 | |
| . | 1 | |
| , | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 317274 |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 6 |
Control
| Value | Count | Frequency (%) |
| | 5 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 317292 | |
| Latin | 24617 | 7.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 6495 | |
| A | 3282 | |
| N | 2525 | 10.3% |
| U | 2333 | 9.5% |
| L | 2237 | 9.1% |
| V | 1994 | 8.1% |
| Z | 1992 | 8.1% |
| S | 610 | 2.5% |
| I | 608 | 2.5% |
| O | 488 | 2.0% |
| Other values (29) | 2053 | 8.3% |
Common
| Value | Count | Frequency (%) |
| 317274 | ||
| 9 | 6 | < 0.1% |
| | 5 | < 0.1% |
| - | 3 | < 0.1% |
| ? | 2 | < 0.1% |
| . | 1 | < 0.1% |
| , | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 341899 | |
| None | 10 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 317274 | ||
| E | 6495 | 1.9% |
| A | 3282 | 1.0% |
| N | 2525 | 0.7% |
| U | 2333 | 0.7% |
| L | 2237 | 0.7% |
| V | 1994 | 0.6% |
| Z | 1992 | 0.6% |
| S | 610 | 0.2% |
| I | 608 | 0.2% |
| Other values (34) | 2549 | 0.7% |
None
| Value | Count | Frequency (%) |
| | 5 | |
| Ã | 5 |
NPCEP14
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
| 1 | |
|---|---|
| 2 | 5134 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 319952 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | |
| 4th row | |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 238336 | ||
| 1 | 76482 | 23.9% |
| 2 | 5134 | 1.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 76482 | |
| 2 | 5134 | 6.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 238336 | ||
| 1 | 76482 | 23.9% |
| 2 | 5134 | 1.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 238336 | |
| Decimal Number | 81616 | 25.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 76482 | |
| 2 | 5134 | 6.3% |
Space Separator
| Value | Count | Frequency (%) |
| 238336 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 319952 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 238336 | ||
| 1 | 76482 | 23.9% |
| 2 | 5134 | 1.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 319952 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 238336 | ||
| 1 | 76482 | 23.9% |
| 2 | 5134 | 1.6% |
NPCEP15
Categorical
IMBALANCE 
| Distinct | 13 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
| 1 | 10903 |
|---|---|
| 11 | 4043 |
| 2 | 2629 |
| 4 | 2143 |
| Other values (8) | 5185 |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.0174214 |
| Min length | 1 |
Characters and Unicode
| Total characters | 325526 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 295049 | ||
| 1 | 10903 | 3.4% |
| 11 | 4043 | 1.3% |
| 2 | 2629 | 0.8% |
| 4 | 2143 | 0.7% |
| 8 | 1131 | 0.4% |
| 10 | 1072 | 0.3% |
| 6 | 912 | 0.3% |
| 3 | 695 | 0.2% |
| 7 | 689 | 0.2% |
| Other values (3) | 686 | 0.2% |
Length
| Value | Count | Frequency (%) |
| 1 | 10903 | |
| 11 | 4043 | 16.2% |
| 2 | 2629 | 10.6% |
| 4 | 2143 | 8.6% |
| 8 | 1131 | 4.5% |
| 10 | 1072 | 4.3% |
| 6 | 912 | 3.7% |
| 3 | 695 | 2.8% |
| 7 | 689 | 2.8% |
| 12 | 459 | 1.8% |
| Other values (2) | 227 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 295049 | ||
| 1 | 20520 | 6.3% |
| 2 | 3088 | 0.9% |
| 4 | 2143 | 0.7% |
| 8 | 1131 | 0.3% |
| 0 | 1072 | 0.3% |
| 6 | 912 | 0.3% |
| 3 | 695 | 0.2% |
| 7 | 689 | 0.2% |
| 9 | 196 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 295049 | |
| Decimal Number | 30477 | 9.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 20520 | |
| 2 | 3088 | 10.1% |
| 4 | 2143 | 7.0% |
| 8 | 1131 | 3.7% |
| 0 | 1072 | 3.5% |
| 6 | 912 | 3.0% |
| 3 | 695 | 2.3% |
| 7 | 689 | 2.3% |
| 9 | 196 | 0.6% |
| 5 | 31 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 295049 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 325526 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 295049 | ||
| 1 | 20520 | 6.3% |
| 2 | 3088 | 0.9% |
| 4 | 2143 | 0.7% |
| 8 | 1131 | 0.3% |
| 0 | 1072 | 0.3% |
| 6 | 912 | 0.3% |
| 3 | 695 | 0.2% |
| 7 | 689 | 0.2% |
| 9 | 196 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 325526 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 295049 | ||
| 1 | 20520 | 6.3% |
| 2 | 3088 | 0.9% |
| 4 | 2143 | 0.7% |
| 8 | 1131 | 0.3% |
| 0 | 1072 | 0.3% |
| 6 | 912 | 0.3% |
| 3 | 695 | 0.2% |
| 7 | 689 | 0.2% |
| 9 | 196 | 0.1% |
NPCEP16A
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
| 1 | 8084 |
|---|
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 319952 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 311868 | ||
| 1 | 8084 | 2.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 8084 |
Most occurring characters
| Value | Count | Frequency (%) |
| 311868 | ||
| 1 | 8084 | 2.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 311868 | |
| Decimal Number | 8084 | 2.5% |
Most frequent character per category
Space Separator
| Value | Count | Frequency (%) |
| 311868 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 8084 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 319952 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 311868 | ||
| 1 | 8084 | 2.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 319952 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 311868 | ||
| 1 | 8084 | 2.5% |
NPCEP16B
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
| 1 | 10255 |
|---|
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 319952 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 309697 | ||
| 1 | 10255 | 3.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 10255 |
Most occurring characters
| Value | Count | Frequency (%) |
| 309697 | ||
| 1 | 10255 | 3.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 309697 | |
| Decimal Number | 10255 | 3.2% |
Most frequent character per category
Space Separator
| Value | Count | Frequency (%) |
| 309697 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 10255 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 319952 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 309697 | ||
| 1 | 10255 | 3.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 319952 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 309697 | ||
| 1 | 10255 | 3.2% |
NPCEP16C
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
| 1 | 1527 |
|---|
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 319952 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 318425 | ||
| 1 | 1527 | 0.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 1527 |
Most occurring characters
| Value | Count | Frequency (%) |
| 318425 | ||
| 1 | 1527 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 318425 | |
| Decimal Number | 1527 | 0.5% |
Most frequent character per category
Space Separator
| Value | Count | Frequency (%) |
| 318425 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1527 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 319952 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 318425 | ||
| 1 | 1527 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 319952 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 318425 | ||
| 1 | 1527 | 0.5% |
NPCEP16D
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
| 1 | 2707 |
|---|
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 319952 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 317245 | ||
| 1 | 2707 | 0.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 2707 |
Most occurring characters
| Value | Count | Frequency (%) |
| 317245 | ||
| 1 | 2707 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 317245 | |
| Decimal Number | 2707 | 0.8% |
Most frequent character per category
Space Separator
| Value | Count | Frequency (%) |
| 317245 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2707 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 319952 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 317245 | ||
| 1 | 2707 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 319952 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 317245 | ||
| 1 | 2707 | 0.8% |
NPCEP16E
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
| 1 | 258 |
|---|
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 319952 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 319694 | ||
| 1 | 258 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 258 |
Most occurring characters
| Value | Count | Frequency (%) |
| 319694 | ||
| 1 | 258 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 319694 | |
| Decimal Number | 258 | 0.1% |
Most frequent character per category
Space Separator
| Value | Count | Frequency (%) |
| 319694 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 258 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 319952 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 319694 | ||
| 1 | 258 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 319952 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 319694 | ||
| 1 | 258 | 0.1% |
NPCEP16F
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
| 1 | 174 |
|---|
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 319952 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 319778 | ||
| 1 | 174 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 174 |
Most occurring characters
| Value | Count | Frequency (%) |
| 319778 | ||
| 1 | 174 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 319778 | |
| Decimal Number | 174 | 0.1% |
Most frequent character per category
Space Separator
| Value | Count | Frequency (%) |
| 319778 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 174 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 319952 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 319778 | ||
| 1 | 174 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 319952 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 319778 | ||
| 1 | 174 | 0.1% |
NPCEP16G
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
| 1 | 50 |
|---|
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 319952 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 319902 | ||
| 1 | 50 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 50 |
Most occurring characters
| Value | Count | Frequency (%) |
| 319902 | ||
| 1 | 50 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 319902 | |
| Decimal Number | 50 | < 0.1% |
Most frequent character per category
Space Separator
| Value | Count | Frequency (%) |
| 319902 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 50 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 319952 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 319902 | ||
| 1 | 50 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 319952 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 319902 | ||
| 1 | 50 | < 0.1% |
NPCEP16H
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
| 1 | 76 |
|---|
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 319952 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 319876 | ||
| 1 | 76 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 76 |
Most occurring characters
| Value | Count | Frequency (%) |
| 319876 | ||
| 1 | 76 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 319876 | |
| Decimal Number | 76 | < 0.1% |
Most frequent character per category
Space Separator
| Value | Count | Frequency (%) |
| 319876 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 76 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 319952 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 319876 | ||
| 1 | 76 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 319952 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 319876 | ||
| 1 | 76 | < 0.1% |
NPCEP16I
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
| 1 | 10 |
|---|
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 319952 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 319942 | ||
| 1 | 10 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 10 |
Most occurring characters
| Value | Count | Frequency (%) |
| 319942 | ||
| 1 | 10 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 319942 | |
| Decimal Number | 10 | < 0.1% |
Most frequent character per category
Space Separator
| Value | Count | Frequency (%) |
| 319942 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 10 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 319952 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 319942 | ||
| 1 | 10 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 319952 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 319942 | ||
| 1 | 10 | < 0.1% |
NPCEP16J
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
| 1 | 134 |
|---|
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 319952 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 319818 | ||
| 1 | 134 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 134 |
Most occurring characters
| Value | Count | Frequency (%) |
| 319818 | ||
| 1 | 134 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 319818 | |
| Decimal Number | 134 | < 0.1% |
Most frequent character per category
Space Separator
| Value | Count | Frequency (%) |
| 319818 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 134 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 319952 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 319818 | ||
| 1 | 134 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 319952 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 319818 | ||
| 1 | 134 | < 0.1% |
NPCEP16K
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
| 1 | 3949 |
|---|
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 319952 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 316003 | ||
| 1 | 3949 | 1.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 3949 |
Most occurring characters
| Value | Count | Frequency (%) |
| 316003 | ||
| 1 | 3949 | 1.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 316003 | |
| Decimal Number | 3949 | 1.2% |
Most frequent character per category
Space Separator
| Value | Count | Frequency (%) |
| 316003 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 3949 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 319952 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 316003 | ||
| 1 | 3949 | 1.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 319952 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 316003 | ||
| 1 | 3949 | 1.2% |
NPCEP16A1
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
| 1 | |
|---|---|
| 2 | 24710 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 319952 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 249324 | |
| 45918 | 14.4% | |
| 2 | 24710 | 7.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 249324 | |
| 2 | 24710 | 9.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 249324 | |
| 45918 | 14.4% | |
| 2 | 24710 | 7.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 274034 | |
| Space Separator | 45918 | 14.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 249324 | |
| 2 | 24710 | 9.0% |
Space Separator
| Value | Count | Frequency (%) |
| 45918 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 319952 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 249324 | |
| 45918 | 14.4% | |
| 2 | 24710 | 7.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 319952 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 249324 | |
| 45918 | 14.4% | |
| 2 | 24710 | 7.7% |
NPCEP16AA
Categorical
IMBALANCE 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
| 1 | 10128 |
|---|---|
| 2 | 4574 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 319952 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 305250 | ||
| 1 | 10128 | 3.2% |
| 2 | 4574 | 1.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 10128 | |
| 2 | 4574 |
Most occurring characters
| Value | Count | Frequency (%) |
| 305250 | ||
| 1 | 10128 | 3.2% |
| 2 | 4574 | 1.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 305250 | |
| Decimal Number | 14702 | 4.6% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 10128 | |
| 2 | 4574 |
Space Separator
| Value | Count | Frequency (%) |
| 305250 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 319952 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 305250 | ||
| 1 | 10128 | 3.2% |
| 2 | 4574 | 1.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 319952 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 305250 | ||
| 1 | 10128 | 3.2% |
| 2 | 4574 | 1.4% |
NPCEP16AB
Categorical
IMBALANCE 
| Distinct | 22 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
| 11 | 1273 |
|---|---|
| 8 | 1233 |
| 10 | 934 |
| 7 | 729 |
| Other values (17) | 5959 |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.0164431 |
| Min length | 1 |
Characters and Unicode
| Total characters | 325213 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 309824 | ||
| 11 | 1273 | 0.4% |
| 8 | 1233 | 0.4% |
| 10 | 934 | 0.3% |
| 7 | 729 | 0.2% |
| 19 | 661 | 0.2% |
| 9 | 606 | 0.2% |
| 18 | 580 | 0.2% |
| 4 | 547 | 0.2% |
| 2 | 500 | 0.2% |
| Other values (12) | 3065 | 1.0% |
Length
| Value | Count | Frequency (%) |
| 11 | 1273 | |
| 8 | 1233 | |
| 10 | 934 | 9.2% |
| 7 | 729 | 7.2% |
| 19 | 661 | 6.5% |
| 9 | 606 | 6.0% |
| 18 | 580 | 5.7% |
| 4 | 547 | 5.4% |
| 2 | 500 | 4.9% |
| 1 | 495 | 4.9% |
| Other values (11) | 2570 |
Most occurring characters
| Value | Count | Frequency (%) |
| 309824 | ||
| 1 | 6758 | 2.1% |
| 8 | 1813 | 0.6% |
| 9 | 1771 | 0.5% |
| 0 | 953 | 0.3% |
| 7 | 863 | 0.3% |
| 2 | 825 | 0.3% |
| 4 | 740 | 0.2% |
| 6 | 738 | 0.2% |
| 5 | 561 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 309824 | |
| Decimal Number | 15389 | 4.7% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 6758 | |
| 8 | 1813 | 11.8% |
| 9 | 1771 | 11.5% |
| 0 | 953 | 6.2% |
| 7 | 863 | 5.6% |
| 2 | 825 | 5.4% |
| 4 | 740 | 4.8% |
| 6 | 738 | 4.8% |
| 5 | 561 | 3.6% |
| 3 | 367 | 2.4% |
Space Separator
| Value | Count | Frequency (%) |
| 309824 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 325213 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 309824 | ||
| 1 | 6758 | 2.1% |
| 8 | 1813 | 0.6% |
| 9 | 1771 | 0.5% |
| 0 | 953 | 0.3% |
| 7 | 863 | 0.3% |
| 2 | 825 | 0.3% |
| 4 | 740 | 0.2% |
| 6 | 738 | 0.2% |
| 5 | 561 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 325213 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 309824 | ||
| 1 | 6758 | 2.1% |
| 8 | 1813 | 0.6% |
| 9 | 1771 | 0.5% |
| 0 | 953 | 0.3% |
| 7 | 863 | 0.3% |
| 2 | 825 | 0.3% |
| 4 | 740 | 0.2% |
| 6 | 738 | 0.2% |
| 5 | 561 | 0.2% |
NPCEP16B1
Categorical
IMBALANCE 
| Distinct | 13 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
| 8 | 5859 |
|---|---|
| 1 | 3708 |
| 7 | 3706 |
| 4 | 3386 |
| Other values (8) | 8051 |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.018159 |
| Min length | 1 |
Characters and Unicode
| Total characters | 325762 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 295242 | ||
| 8 | 5859 | 1.8% |
| 1 | 3708 | 1.2% |
| 7 | 3706 | 1.2% |
| 4 | 3386 | 1.1% |
| 11 | 3344 | 1.0% |
| 10 | 1831 | 0.6% |
| 2 | 987 | 0.3% |
| 12 | 635 | 0.2% |
| 3 | 464 | 0.1% |
| Other values (3) | 790 | 0.2% |
Length
| Value | Count | Frequency (%) |
| 8 | 5859 | |
| 1 | 3708 | |
| 7 | 3706 | |
| 4 | 3386 | |
| 11 | 3344 | |
| 10 | 1831 | 7.4% |
| 2 | 987 | 4.0% |
| 12 | 635 | 2.6% |
| 3 | 464 | 1.9% |
| 9 | 379 | 1.5% |
| Other values (2) | 411 | 1.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 295242 | ||
| 1 | 12862 | 3.9% |
| 8 | 5859 | 1.8% |
| 7 | 3706 | 1.1% |
| 4 | 3386 | 1.0% |
| 0 | 1831 | 0.6% |
| 2 | 1622 | 0.5% |
| 3 | 464 | 0.1% |
| 9 | 379 | 0.1% |
| 6 | 336 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 295242 | |
| Decimal Number | 30520 | 9.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 12862 | |
| 8 | 5859 | |
| 7 | 3706 | 12.1% |
| 4 | 3386 | 11.1% |
| 0 | 1831 | 6.0% |
| 2 | 1622 | 5.3% |
| 3 | 464 | 1.5% |
| 9 | 379 | 1.2% |
| 6 | 336 | 1.1% |
| 5 | 75 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 295242 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 325762 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 295242 | ||
| 1 | 12862 | 3.9% |
| 8 | 5859 | 1.8% |
| 7 | 3706 | 1.1% |
| 4 | 3386 | 1.0% |
| 0 | 1831 | 0.6% |
| 2 | 1622 | 0.5% |
| 3 | 464 | 0.1% |
| 9 | 379 | 0.1% |
| 6 | 336 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 325762 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 295242 | ||
| 1 | 12862 | 3.9% |
| 8 | 5859 | 1.8% |
| 7 | 3706 | 1.1% |
| 4 | 3386 | 1.0% |
| 0 | 1831 | 0.6% |
| 2 | 1622 | 0.5% |
| 3 | 464 | 0.1% |
| 9 | 379 | 0.1% |
| 6 | 336 | 0.1% |
NPCEP17
Real number (ℝ)
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.9781405 |
| Minimum | 1 |
|---|---|
| Maximum | 6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 6 |
| median | 6 |
| Q3 | 6 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 5 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.28431428 |
|---|---|
| Coefficient of variation (CV) | 0.047558983 |
| Kurtosis | 266.34015 |
| Mean | 5.9781405 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -15.908514 |
| Sum | 1912718 |
| Variance | 0.08083461 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6 | 316907 | |
| 5 | 1973 | 0.6% |
| 1 | 856 | 0.3% |
| 2 | 117 | < 0.1% |
| 3 | 75 | < 0.1% |
| 4 | 24 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 856 | 0.3% |
| 2 | 117 | < 0.1% |
| 3 | 75 | < 0.1% |
| 4 | 24 | < 0.1% |
| 5 | 1973 | 0.6% |
| 6 | 316907 |
| Value | Count | Frequency (%) |
| 6 | 316907 | |
| 5 | 1973 | 0.6% |
| 4 | 24 | < 0.1% |
| 3 | 75 | < 0.1% |
| 2 | 117 | < 0.1% |
| 1 | 856 | 0.3% |
NPCEP18
Text
| Distinct | 103 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 1 |
| Mean length | 1.013377 |
| Min length | 1 |
Characters and Unicode
| Total characters | 324232 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 40 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | |
| 5th row |
| Value | Count | Frequency (%) |
| 470_01 | 236 | |
| 999_01 | 97 | 11.3% |
| 200_01 | 80 | 9.3% |
| 470_02 | 61 | 7.1% |
| 840_01 | 33 | 3.9% |
| 500_03 | 29 | 3.4% |
| 650_01 | 20 | 2.3% |
| 340_01 | 18 | 2.1% |
| 500_01 | 17 | 2.0% |
| 560_01 | 15 | 1.8% |
| Other values (92) | 250 |
Most occurring characters
| Value | Count | Frequency (%) |
| 319096 | ||
| 0 | 1798 | 0.6% |
| _ | 856 | 0.3% |
| 1 | 668 | 0.2% |
| 4 | 395 | 0.1% |
| 7 | 358 | 0.1% |
| 9 | 315 | 0.1% |
| 2 | 308 | 0.1% |
| 5 | 140 | < 0.1% |
| 3 | 121 | < 0.1% |
| Other values (2) | 177 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 319096 | |
| Decimal Number | 4280 | 1.3% |
| Connector Punctuation | 856 | 0.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1798 | |
| 1 | 668 | 15.6% |
| 4 | 395 | 9.2% |
| 7 | 358 | 8.4% |
| 9 | 315 | 7.4% |
| 2 | 308 | 7.2% |
| 5 | 140 | 3.3% |
| 3 | 121 | 2.8% |
| 8 | 104 | 2.4% |
| 6 | 73 | 1.7% |
Space Separator
| Value | Count | Frequency (%) |
| 319096 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 856 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 324232 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 319096 | ||
| 0 | 1798 | 0.6% |
| _ | 856 | 0.3% |
| 1 | 668 | 0.2% |
| 4 | 395 | 0.1% |
| 7 | 358 | 0.1% |
| 9 | 315 | 0.1% |
| 2 | 308 | 0.1% |
| 5 | 140 | < 0.1% |
| 3 | 121 | < 0.1% |
| Other values (2) | 177 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 324232 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 319096 | ||
| 0 | 1798 | 0.6% |
| _ | 856 | 0.3% |
| 1 | 668 | 0.2% |
| 4 | 395 | 0.1% |
| 7 | 358 | 0.1% |
| 9 | 315 | 0.1% |
| 2 | 308 | 0.1% |
| 5 | 140 | < 0.1% |
| 3 | 121 | < 0.1% |
| Other values (2) | 177 | 0.1% |
NPCEP19
Categorical
IMBALANCE 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
| 2 | 745 |
|---|---|
| 1 | 327 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 319952 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 318880 | ||
| 2 | 745 | 0.2% |
| 1 | 327 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 745 | |
| 1 | 327 |
Most occurring characters
| Value | Count | Frequency (%) |
| 318880 | ||
| 2 | 745 | 0.2% |
| 1 | 327 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 318880 | |
| Decimal Number | 1072 | 0.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 745 | |
| 1 | 327 |
Space Separator
| Value | Count | Frequency (%) |
| 318880 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 319952 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 318880 | ||
| 2 | 745 | 0.2% |
| 1 | 327 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 319952 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 318880 | ||
| 2 | 745 | 0.2% |
| 1 | 327 | 0.1% |
NPCEP21
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
| 2 | |
|---|---|
| 3 | |
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 319952 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 2 |
| 3rd row | 1 |
| 4th row | 2 |
| 5th row | 3 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 149135 | |
| 3 | 93523 | |
| 1 | 77294 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 149135 | |
| 3 | 93523 | |
| 1 | 77294 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 149135 | |
| 3 | 93523 | |
| 1 | 77294 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 319952 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 149135 | |
| 3 | 93523 | |
| 1 | 77294 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 319952 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 149135 | |
| 3 | 93523 | |
| 1 | 77294 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 319952 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 149135 | |
| 3 | 93523 | |
| 1 | 77294 |
NPCEP21A
Categorical
IMBALANCE 
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
| 1 | |
|---|---|
| 2 | 11420 |
| 3 | 1806 |
| 4 | 1094 |
| Other values (9) | 1068 |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.0000344 |
| Min length | 1 |
Characters and Unicode
| Total characters | 319963 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | |
|---|---|
| 2nd row | |
| 3rd row | 1 |
| 4th row | |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 242654 | ||
| 1 | 61910 | 19.3% |
| 2 | 11420 | 3.6% |
| 3 | 1806 | 0.6% |
| 4 | 1094 | 0.3% |
| 5 | 603 | 0.2% |
| 6 | 252 | 0.1% |
| 7 | 113 | < 0.1% |
| 8 | 55 | < 0.1% |
| 9 | 34 | < 0.1% |
| Other values (4) | 11 | < 0.1% |
Length
| Value | Count | Frequency (%) |
| 1 | 61910 | |
| 2 | 11420 | 14.8% |
| 3 | 1806 | 2.3% |
| 4 | 1094 | 1.4% |
| 5 | 603 | 0.8% |
| 6 | 252 | 0.3% |
| 7 | 113 | 0.1% |
| 8 | 55 | 0.1% |
| 9 | 34 | < 0.1% |
| 10 | 7 | < 0.1% |
| Other values (3) | 4 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 242654 | ||
| 1 | 61923 | 19.4% |
| 2 | 11421 | 3.6% |
| 3 | 1806 | 0.6% |
| 4 | 1094 | 0.3% |
| 5 | 604 | 0.2% |
| 6 | 252 | 0.1% |
| 7 | 113 | < 0.1% |
| 8 | 55 | < 0.1% |
| 9 | 34 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 242654 | |
| Decimal Number | 77309 | 24.2% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 61923 | |
| 2 | 11421 | 14.8% |
| 3 | 1806 | 2.3% |
| 4 | 1094 | 1.4% |
| 5 | 604 | 0.8% |
| 6 | 252 | 0.3% |
| 7 | 113 | 0.1% |
| 8 | 55 | 0.1% |
| 9 | 34 | < 0.1% |
| 0 | 7 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 242654 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 319963 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 242654 | ||
| 1 | 61923 | 19.4% |
| 2 | 11421 | 3.6% |
| 3 | 1806 | 0.6% |
| 4 | 1094 | 0.3% |
| 5 | 604 | 0.2% |
| 6 | 252 | 0.1% |
| 7 | 113 | < 0.1% |
| 8 | 55 | < 0.1% |
| 9 | 34 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 319963 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 242654 | ||
| 1 | 61923 | 19.4% |
| 2 | 11421 | 3.6% |
| 3 | 1806 | 0.6% |
| 4 | 1094 | 0.3% |
| 5 | 604 | 0.2% |
| 6 | 252 | 0.1% |
| 7 | 113 | < 0.1% |
| 8 | 55 | < 0.1% |
| 9 | 34 | < 0.1% |
NPCEP22
Categorical
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
| 2 | |
|---|---|
| 99 | |
| 4 | |
| 1 | |
| Other values (7) |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.2092189 |
| Min length | 1 |
Characters and Unicode
| Total characters | 386892 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 77294 | ||
| 2 | 76824 | |
| 99 | 45019 | |
| 4 | 38640 | |
| 1 | 28756 | 9.0% |
| 10 | 21921 | 6.9% |
| 8 | 13574 | 4.2% |
| 3 | 8085 | 2.5% |
| 6 | 5112 | 1.6% |
| 9 | 2123 | 0.7% |
| Other values (2) | 2604 | 0.8% |
Length
| Value | Count | Frequency (%) |
| 2 | 76824 | |
| 99 | 45019 | |
| 4 | 38640 | |
| 1 | 28756 | 11.9% |
| 10 | 21921 | 9.0% |
| 8 | 13574 | 5.6% |
| 3 | 8085 | 3.3% |
| 6 | 5112 | 2.1% |
| 9 | 2123 | 0.9% |
| 5 | 1403 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 9 | 92161 | |
| 77294 | ||
| 2 | 76824 | |
| 1 | 50677 | |
| 4 | 38640 | |
| 0 | 21921 | 5.7% |
| 8 | 13574 | 3.5% |
| 3 | 8085 | 2.1% |
| 6 | 5112 | 1.3% |
| 5 | 1403 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 309598 | |
| Space Separator | 77294 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 92161 | |
| 2 | 76824 | |
| 1 | 50677 | |
| 4 | 38640 | |
| 0 | 21921 | 7.1% |
| 8 | 13574 | 4.4% |
| 3 | 8085 | 2.6% |
| 6 | 5112 | 1.7% |
| 5 | 1403 | 0.5% |
| 7 | 1201 | 0.4% |
Space Separator
| Value | Count | Frequency (%) |
| 77294 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 386892 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 9 | 92161 | |
| 77294 | ||
| 2 | 76824 | |
| 1 | 50677 | |
| 4 | 38640 | |
| 0 | 21921 | 5.7% |
| 8 | 13574 | 3.5% |
| 3 | 8085 | 2.1% |
| 6 | 5112 | 1.3% |
| 5 | 1403 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 386892 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 9 | 92161 | |
| 77294 | ||
| 2 | 76824 | |
| 1 | 50677 | |
| 4 | 38640 | |
| 0 | 21921 | 5.7% |
| 8 | 13574 | 3.5% |
| 3 | 8085 | 2.1% |
| 6 | 5112 | 1.3% |
| 5 | 1403 | 0.4% |
NPCEP22A
Categorical
IMBALANCE 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
| 3 | 14521 |
|---|---|
| 2 | 13257 |
| 4 | 7191 |
| 1 | 5302 |
| Other values (2) | 1297 |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.0000031 |
| Min length | 1 |
Characters and Unicode
| Total characters | 319953 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 2 |
| 3rd row | |
| 4th row | 4 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 278384 | ||
| 3 | 14521 | 4.5% |
| 2 | 13257 | 4.1% |
| 4 | 7191 | 2.2% |
| 1 | 5302 | 1.7% |
| 5 | 1296 | 0.4% |
| 10 | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 3 | 14521 | |
| 2 | 13257 | |
| 4 | 7191 | |
| 1 | 5302 | 12.8% |
| 5 | 1296 | 3.1% |
| 10 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 278384 | ||
| 3 | 14521 | 4.5% |
| 2 | 13257 | 4.1% |
| 4 | 7191 | 2.2% |
| 1 | 5303 | 1.7% |
| 5 | 1296 | 0.4% |
| 0 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 278384 | |
| Decimal Number | 41569 | 13.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 14521 | |
| 2 | 13257 | |
| 4 | 7191 | |
| 1 | 5303 | 12.8% |
| 5 | 1296 | 3.1% |
| 0 | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 278384 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 319953 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 278384 | ||
| 3 | 14521 | 4.5% |
| 2 | 13257 | 4.1% |
| 4 | 7191 | 2.2% |
| 1 | 5303 | 1.7% |
| 5 | 1296 | 0.4% |
| 0 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 319953 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 278384 | ||
| 3 | 14521 | 4.5% |
| 2 | 13257 | 4.1% |
| 4 | 7191 | 2.2% |
| 1 | 5303 | 1.7% |
| 5 | 1296 | 0.4% |
| 0 | 1 | < 0.1% |
NPCEP24
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
| 2 | |
|---|---|
| 1 | |
| 3 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 319952 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 2 |
| 3rd row | 1 |
| 4th row | 2 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 127967 | |
| 1 | 124708 | |
| 3 | 67277 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 127967 | |
| 1 | 124708 | |
| 3 | 67277 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 127967 | |
| 1 | 124708 | |
| 3 | 67277 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 319952 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 127967 | |
| 1 | 124708 | |
| 3 | 67277 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 319952 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 127967 | |
| 1 | 124708 | |
| 3 | 67277 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 319952 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 127967 | |
| 1 | 124708 | |
| 3 | 67277 |
NPCEP24A
Categorical
IMBALANCE 
| Distinct | 15 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
| 2 | |
|---|---|
| 1 | |
| 3 | 6180 |
| 4 | 3053 |
| Other values (10) | 2832 |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.0001344 |
| Min length | 1 |
Characters and Unicode
| Total characters | 319995 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | |
|---|---|
| 2nd row | |
| 3rd row | 2 |
| 4th row | |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 195240 | ||
| 2 | 68542 | 21.4% |
| 1 | 44105 | 13.8% |
| 3 | 6180 | 1.9% |
| 4 | 3053 | 1.0% |
| 5 | 1666 | 0.5% |
| 6 | 717 | 0.2% |
| 7 | 253 | 0.1% |
| 8 | 100 | < 0.1% |
| 9 | 53 | < 0.1% |
| Other values (5) | 43 | < 0.1% |
Length
| Value | Count | Frequency (%) |
| 2 | 68542 | |
| 1 | 44105 | |
| 3 | 6180 | 5.0% |
| 4 | 3053 | 2.4% |
| 5 | 1666 | 1.3% |
| 6 | 717 | 0.6% |
| 7 | 253 | 0.2% |
| 8 | 100 | 0.1% |
| 9 | 53 | < 0.1% |
| 10 | 24 | < 0.1% |
| Other values (4) | 19 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 195240 | ||
| 2 | 68546 | 21.4% |
| 1 | 44160 | 13.8% |
| 3 | 6182 | 1.9% |
| 4 | 3053 | 1.0% |
| 5 | 1666 | 0.5% |
| 6 | 718 | 0.2% |
| 7 | 253 | 0.1% |
| 8 | 100 | < 0.1% |
| 9 | 53 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 195240 | |
| Decimal Number | 124755 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 68546 | |
| 1 | 44160 | |
| 3 | 6182 | 5.0% |
| 4 | 3053 | 2.4% |
| 5 | 1666 | 1.3% |
| 6 | 718 | 0.6% |
| 7 | 253 | 0.2% |
| 8 | 100 | 0.1% |
| 9 | 53 | < 0.1% |
| 0 | 24 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 195240 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 319995 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 195240 | ||
| 2 | 68546 | 21.4% |
| 1 | 44160 | 13.8% |
| 3 | 6182 | 1.9% |
| 4 | 3053 | 1.0% |
| 5 | 1666 | 0.5% |
| 6 | 718 | 0.2% |
| 7 | 253 | 0.1% |
| 8 | 100 | < 0.1% |
| 9 | 53 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 319995 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 195240 | ||
| 2 | 68546 | 21.4% |
| 1 | 44160 | 13.8% |
| 3 | 6182 | 1.9% |
| 4 | 3053 | 1.0% |
| 5 | 1666 | 0.5% |
| 6 | 718 | 0.2% |
| 7 | 253 | 0.1% |
| 8 | 100 | < 0.1% |
| 9 | 53 | < 0.1% |
NPCEP25
Categorical
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
| 2 | |
|---|---|
| 4 | |
| 99 | |
| 1 | |
| Other values (7) |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.1541231 |
| Min length | 1 |
Characters and Unicode
| Total characters | 369264 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 10 |
| 3rd row | |
| 4th row | 3 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 124708 | ||
| 2 | 68734 | |
| 4 | 29534 | 9.2% |
| 99 | 27876 | 8.7% |
| 1 | 27267 | 8.5% |
| 10 | 21436 | 6.7% |
| 8 | 7094 | 2.2% |
| 3 | 6607 | 2.1% |
| 6 | 3822 | 1.2% |
| 9 | 1292 | 0.4% |
| Other values (2) | 1582 | 0.5% |
Length
| Value | Count | Frequency (%) |
| 2 | 68734 | |
| 4 | 29534 | |
| 99 | 27876 | |
| 1 | 27267 | 14.0% |
| 10 | 21436 | 11.0% |
| 8 | 7094 | 3.6% |
| 3 | 6607 | 3.4% |
| 6 | 3822 | 2.0% |
| 9 | 1292 | 0.7% |
| 5 | 1021 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 124708 | ||
| 2 | 68734 | |
| 9 | 57044 | |
| 1 | 48703 | 13.2% |
| 4 | 29534 | 8.0% |
| 0 | 21436 | 5.8% |
| 8 | 7094 | 1.9% |
| 3 | 6607 | 1.8% |
| 6 | 3822 | 1.0% |
| 5 | 1021 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 244556 | |
| Space Separator | 124708 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 68734 | |
| 9 | 57044 | |
| 1 | 48703 | |
| 4 | 29534 | |
| 0 | 21436 | 8.8% |
| 8 | 7094 | 2.9% |
| 3 | 6607 | 2.7% |
| 6 | 3822 | 1.6% |
| 5 | 1021 | 0.4% |
| 7 | 561 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 124708 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 369264 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 124708 | ||
| 2 | 68734 | |
| 9 | 57044 | |
| 1 | 48703 | 13.2% |
| 4 | 29534 | 8.0% |
| 0 | 21436 | 5.8% |
| 8 | 7094 | 1.9% |
| 3 | 6607 | 1.8% |
| 6 | 3822 | 1.0% |
| 5 | 1021 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 369264 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 124708 | ||
| 2 | 68734 | |
| 9 | 57044 | |
| 1 | 48703 | 13.2% |
| 4 | 29534 | 8.0% |
| 0 | 21436 | 5.8% |
| 8 | 7094 | 1.9% |
| 3 | 6607 | 1.8% |
| 6 | 3822 | 1.0% |
| 5 | 1021 | 0.3% |
NPCEP25A
Categorical
IMBALANCE 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
| 3 | 12676 |
|---|---|
| 2 | 11744 |
| 4 | 6310 |
| 1 | 5109 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 319952 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | 4 |
| 5th row | 3 |
Common Values
| Value | Count | Frequency (%) |
| 283204 | ||
| 3 | 12676 | 4.0% |
| 2 | 11744 | 3.7% |
| 4 | 6310 | 2.0% |
| 1 | 5109 | 1.6% |
| 5 | 909 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 3 | 12676 | |
| 2 | 11744 | |
| 4 | 6310 | |
| 1 | 5109 | |
| 5 | 909 | 2.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 283204 | ||
| 3 | 12676 | 4.0% |
| 2 | 11744 | 3.7% |
| 4 | 6310 | 2.0% |
| 1 | 5109 | 1.6% |
| 5 | 909 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 283204 | |
| Decimal Number | 36748 | 11.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 12676 | |
| 2 | 11744 | |
| 4 | 6310 | |
| 1 | 5109 | |
| 5 | 909 | 2.5% |
Space Separator
| Value | Count | Frequency (%) |
| 283204 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 319952 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 283204 | ||
| 3 | 12676 | 4.0% |
| 2 | 11744 | 3.7% |
| 4 | 6310 | 2.0% |
| 1 | 5109 | 1.6% |
| 5 | 909 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 319952 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 283204 | ||
| 3 | 12676 | 4.0% |
| 2 | 11744 | 3.7% |
| 4 | 6310 | 2.0% |
| 1 | 5109 | 1.6% |
| 5 | 909 | 0.3% |
NPCEP27
Categorical
IMBALANCE 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
| 1 | |
|---|---|
| 2 | 1385 |
| 3 | 464 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 319952 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 242829 | |
| 75274 | 23.5% | |
| 2 | 1385 | 0.4% |
| 3 | 464 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 242829 | |
| 2 | 1385 | 0.6% |
| 3 | 464 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 242829 | |
| 75274 | 23.5% | |
| 2 | 1385 | 0.4% |
| 3 | 464 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 244678 | |
| Space Separator | 75274 | 23.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 242829 | |
| 2 | 1385 | 0.6% |
| 3 | 464 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 75274 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 319952 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 242829 | |
| 75274 | 23.5% | |
| 2 | 1385 | 0.4% |
| 3 | 464 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 319952 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 242829 | |
| 75274 | 23.5% | |
| 2 | 1385 | 0.4% |
| 3 | 464 | 0.1% |
NPCEP26
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
| 1 | |
|---|---|
| 2 | |
| 3 | 112 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 319952 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 2 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 134105 | |
| 2 | 110461 | |
| 75274 | ||
| 3 | 112 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 134105 | |
| 2 | 110461 | |
| 3 | 112 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 134105 | |
| 2 | 110461 | |
| 75274 | ||
| 3 | 112 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 244678 | |
| Space Separator | 75274 | 23.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 134105 | |
| 2 | 110461 | |
| 3 | 112 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 75274 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 319952 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 134105 | |
| 2 | 110461 | |
| 75274 | ||
| 3 | 112 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 319952 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 134105 | |
| 2 | 110461 | |
| 75274 | ||
| 3 | 112 | < 0.1% |
NPCEP5A
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
| 2 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 319952 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 1 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 169270 | |
| 1 | 150682 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 169270 | |
| 1 | 150682 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 169270 | |
| 1 | 150682 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 319952 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 169270 | |
| 1 | 150682 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 319952 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 169270 | |
| 1 | 150682 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 319952 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 169270 | |
| 1 | 150682 |
FEX_C
Text
| Distinct | 33342 |
|---|---|
| Distinct (%) | 10.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 12 |
| Mean length | 11.08199 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3545705 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 327 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 16,604442041 |
|---|---|
| 2nd row | 16,604442041 |
| 3rd row | 16,604442041 |
| 4th row | 26,046357048 |
| 5th row | 26,046357048 |
| Value | Count | Frequency (%) |
| 1 | 23508 | 7.3% |
| 4,1739956106 | 1420 | 0.4% |
| 5,8873957278 | 1186 | 0.4% |
| 3,3593691726 | 766 | 0.2% |
| 12,432040251 | 720 | 0.2% |
| 9,2536882129 | 689 | 0.2% |
| 17,68478686 | 686 | 0.2% |
| 24,278983997 | 615 | 0.2% |
| 30,327762873 | 608 | 0.2% |
| 1,9622544989 | 547 | 0.2% |
| Other values (33332) | 289207 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 380078 | |
| 2 | 353237 | |
| 3 | 342151 | |
| 4 | 331896 | |
| 5 | 326012 | |
| 6 | 321031 | |
| 7 | 314468 | |
| 8 | 308028 | |
| 9 | 306766 | |
| , | 296444 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3249261 | |
| Other Punctuation | 296444 | 8.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 380078 | |
| 2 | 353237 | |
| 3 | 342151 | |
| 4 | 331896 | |
| 5 | 326012 | |
| 6 | 321031 | |
| 7 | 314468 | |
| 8 | 308028 | |
| 9 | 306766 | |
| 0 | 265594 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 296444 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3545705 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 380078 | |
| 2 | 353237 | |
| 3 | 342151 | |
| 4 | 331896 | |
| 5 | 326012 | |
| 6 | 321031 | |
| 7 | 314468 | |
| 8 | 308028 | |
| 9 | 306766 | |
| , | 296444 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3545705 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 380078 | |
| 2 | 353237 | |
| 3 | 342151 | |
| 4 | 331896 | |
| 5 | 326012 | |
| 6 | 321031 | |
| 7 | 314468 | |
| 8 | 308028 | |
| 9 | 306766 | |
| , | 296444 |
| DIRECTORIO_PER | DIRECTORIO_HOG | DIRECTORIO | SECUENCIA_P | ORDEN | NPCEP4 | NPCEP5 | NPCEP6 | NPCEP7 | NPCEP8 | NPCEP8A | NPCEP9 | NPCEP9A | NPCEP9B | NPCEP10 | NPCEP11A | NPCEP11AA | NPCEP11AB | NPCEP11AC | NPCEP11 | NPCEP13 | NPCEP13A | NPCEP13B | NPCEP13C | NPCEP14 | NPCEP15 | NPCEP16A | NPCEP16B | NPCEP16C | NPCEP16D | NPCEP16E | NPCEP16F | NPCEP16G | NPCEP16H | NPCEP16I | NPCEP16J | NPCEP16K | NPCEP16A1 | NPCEP16AA | NPCEP16AB | NPCEP16B1 | NPCEP17 | NPCEP18 | NPCEP19 | NPCEP21 | NPCEP21A | NPCEP22 | NPCEP22A | NPCEP24 | NPCEP24A | NPCEP25 | NPCEP25A | NPCEP27 | NPCEP26 | NPCEP5A | FEX_C | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 10100011 | 1010001 | 101000 | 1 | 1 | 56 | 1 | 1 | 6 | 1 | 2 | 3 | 15 | 15842 | 2 | 2 | 15 | 15842 | 2 | 2 | 1 | 1 | 6 | 2 | 1 | 3 | 2 | 2 | 1 | 2 | 1 | 16,604442041 | ||||||||||||||||||||||||
| 1 | 10100012 | 1010001 | 101000 | 1 | 2 | 48 | 2 | 2 | 6 | 1 | 1 | 3 | 15 | 15842 | 2 | 1 | 2 | 2 | 1 | 1 | 6 | 2 | 1 | 2 | 2 | 10 | 1 | 1 | 2 | 16,604442041 | ||||||||||||||||||||||||||
| 2 | 10100013 | 1010001 | 101000 | 1 | 3 | 22 | 2 | 3 | 5 | 2 | 1 | 1 | 1 | 1 | 6 | 1 | 1 | 1 | 2 | 1 | 1 | 2 | 16,604442041 | |||||||||||||||||||||||||||||||||
| 3 | 10100111 | 1010011 | 101001 | 1 | 1 | 42 | 1 | 1 | 6 | 1 | 2 | 2 | 1 | 1 | 1 | 1 | 6 | 2 | 1 | 4 | 2 | 3 | 4 | 1 | 2 | 1 | 26,046357048 | |||||||||||||||||||||||||||||
| 4 | 10100112 | 1010011 | 101001 | 1 | 2 | 43 | 2 | 2 | 6 | 1 | 1 | 2 | 1 | 1 | 1 | 1 | 6 | 3 | 1 | 1 | 2 | 1 | 3 | 1 | 1 | 2 | 26,046357048 | |||||||||||||||||||||||||||||
| 5 | 10100113 | 1010011 | 101001 | 1 | 3 | 20 | 2 | 3 | 5 | 2 | 1 | 1 | 1 | 1 | 6 | 1 | 1 | 1 | 2 | 1 | 1 | 2 | 26,046357048 | |||||||||||||||||||||||||||||||||
| 6 | 10100114 | 1010011 | 101001 | 1 | 4 | 1 | 2 | 3 | 2 | 1 | 1 | 1 | 6 | 1 | 1 | 1 | 2 | 2 | 26,046357048 | |||||||||||||||||||||||||||||||||||||
| 7 | 10100211 | 1010021 | 101002 | 1 | 1 | 65 | 2 | 1 | 3 | 3 | 15 | 15244 | 1 | 1 | 2 | 2 | 1 | 1 | 6 | 3 | 1 | 2 | 3 | 1 | 2 | 1 | 1 | 2 | 13,840826089 | |||||||||||||||||||||||||||
| 8 | 10100212 | 1010021 | 101002 | 1 | 2 | 33 | 1 | 3 | 5 | 2 | 1 | 1 | 1 | 1 | 6 | 3 | 6 | 1 | 1 | 1 | 2 | 1 | 13,840826089 | |||||||||||||||||||||||||||||||||
| 9 | 10100311 | 1010031 | 101003 | 1 | 1 | 64 | 1 | 1 | 6 | 1 | 2 | 3 | 15 | 15660 | 2 | 2 | 15 | 15660 | 2 | 2 | 2 | 1 | 6 | 3 | 10 | 3 | 10 | 1 | 2 | 1 | 7,0111108805 |
| DIRECTORIO_PER | DIRECTORIO_HOG | DIRECTORIO | SECUENCIA_P | ORDEN | NPCEP4 | NPCEP5 | NPCEP6 | NPCEP7 | NPCEP8 | NPCEP8A | NPCEP9 | NPCEP9A | NPCEP9B | NPCEP10 | NPCEP11A | NPCEP11AA | NPCEP11AB | NPCEP11AC | NPCEP11 | NPCEP13 | NPCEP13A | NPCEP13B | NPCEP13C | NPCEP14 | NPCEP15 | NPCEP16A | NPCEP16B | NPCEP16C | NPCEP16D | NPCEP16E | NPCEP16F | NPCEP16G | NPCEP16H | NPCEP16I | NPCEP16J | NPCEP16K | NPCEP16A1 | NPCEP16AA | NPCEP16AB | NPCEP16B1 | NPCEP17 | NPCEP18 | NPCEP19 | NPCEP21 | NPCEP21A | NPCEP22 | NPCEP22A | NPCEP24 | NPCEP24A | NPCEP25 | NPCEP25A | NPCEP27 | NPCEP26 | NPCEP5A | FEX_C | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 319942 | 31783513 | 3178351 | 317835 | 1 | 3 | 22 | 1 | 3 | 5 | 2 | 1 | 1 | 1 | 1 | 6 | 1 | 1 | 1 | 2 | 1 | 2 | 1 | 1 | |||||||||||||||||||||||||||||||||
| 319943 | 31783514 | 3178351 | 317835 | 1 | 4 | 18 | 1 | 3 | 5 | 2 | 2 | 1 | 1 | 1 | 6 | 1 | 1 | 1 | 2 | 1 | 2 | 1 | 1 | |||||||||||||||||||||||||||||||||
| 319944 | 31784411 | 3178441 | 317844 | 1 | 1 | 39 | 1 | 1 | 2 | 1 | 2 | 2 | 2 | 1 | 1 | 1 | 6 | 2 | 10 | 2 | 10 | 1 | 2 | 1 | 1 | |||||||||||||||||||||||||||||||
| 319945 | 31784412 | 3178441 | 317844 | 1 | 2 | 39 | 2 | 2 | 2 | 1 | 1 | 2 | 2 | 1 | 1 | 1 | 6 | 2 | 10 | 2 | 10 | 1 | 1 | 2 | 1 | |||||||||||||||||||||||||||||||
| 319946 | 31785911 | 3178591 | 317859 | 1 | 1 | 59 | 2 | 1 | 3 | 2 | 1 | 2 | 15 | 15087 | 1 | 1 | 6 | 3 | 1 | 1 | 3 | 1 | 1 | 1 | 1 | 2 | 1 | |||||||||||||||||||||||||||||
| 319947 | 31787411 | 3178741 | 317874 | 1 | 1 | 25 | 1 | 1 | 5 | 2 | 1 | 1 | 1 | 1 | 6 | 2 | 2 | 2 | 2 | 1 | 2 | 1 | 1 | |||||||||||||||||||||||||||||||||
| 319948 | 31788111 | 3178811 | 317881 | 1 | 1 | 32 | 1 | 1 | 5 | 2 | 2 | 1 | 1 | 1 | 6 | 3 | 10 | 1 | 2 | 1 | 1 | 1 | 1 | |||||||||||||||||||||||||||||||||
| 319949 | 31788112 | 3178811 | 317881 | 1 | 2 | 60 | 2 | 5 | 3 | 2 | 2 | 1 | 1 | 1 | 6 | 3 | 10 | 3 | 10 | 1 | 1 | 2 | 1 | |||||||||||||||||||||||||||||||||
| 319950 | 31788511 | 3178851 | 317885 | 1 | 1 | 51 | 1 | 1 | 2 | 1 | 2 | 2 | 2 | 1 | 1 | 1 | 6 | 2 | 10 | 2 | 10 | 1 | 2 | 1 | 1 | |||||||||||||||||||||||||||||||
| 319951 | 31788512 | 3178851 | 317885 | 1 | 2 | 47 | 2 | 2 | 2 | 1 | 1 | 2 | 2 | 1 | 1 | 1 | 6 | 2 | 10 | 2 | 10 | 1 | 1 | 2 | 1 |